Overview

Dataset Statistics

Number of Variables 27
Number of Rows 3000
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 3.4 MB
Average Row Size in Memory 1.2 KB
Variable Types
  • Numerical: 8
  • Categorical: 19

Dataset Insights

NumOpioidPrescriptions and NumHealthcareVisits have similar distributions Similar Distribution
DurationOfPrescriptions and Duration have similar distributions Similar Distribution
PrescriptionDate has a high cardinality: 1355 distinct values High Cardinality
TimeofAppointment has a high cardinality: 2945 distinct values High Cardinality
TimeSeenbyPhysician has a high cardinality: 2935 distinct values High Cardinality
ZipCode has constant length 5 Constant Length
NumHospitalizations has constant length 1 Constant Length
PrescriptionDate has constant length 10 Constant Length
Refills has constant length 1 Constant Length
TimeofAppointment has constant length 8 Constant Length
TimeSeenbyPhysician has constant length 8 Constant Length
NumHealthcareVisits has 164 (5.47%) zeros Zeros
  • 1
  • 2

Variables


PatientID

numerical

Approximate Distinct Count 3000
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 4.9031e+12
Minimum 127765011
Maximum 9995945604714
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • PatientID is skewed right (γ1 = 0.0481)

Quantile Statistics

Minimum 127765011
5-th Percentile 5.2104e+11
Q1 2.4519e+12
Median 4.8298e+12
Q3 7.3471e+12
95-th Percentile 9.4439e+12
Maximum 9995945604714
Range 9995817839703
IQR 4.8951e+12

Descriptive Statistics

Mean 4.9031e+12
Standard Deviation 2.8598e+12
Variance 8.1782e+24
Sum 1.4709e+16
Skewness 0.04815
Kurtosis -1.1847
Coefficient of Variation 0.5833

Age

numerical

Approximate Distinct Count 62
Approximate Unique (%) 2.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 48.2457
Minimum 18
Maximum 79
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Age is skewed right (γ1 = 0.031)

Quantile Statistics

Minimum 18
5-th Percentile 21
Q1 33
Median 48
Q3 64
95-th Percentile 77
Maximum 79
Range 61
IQR 31

Descriptive Statistics

Mean 48.2457
Standard Deviation 17.867
Variance 319.2311
Sum 144737
Skewness 0.03098
Kurtosis -1.1938
Coefficient of Variation 0.3703

Gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 209912

Length

Mean 4.9707
Standard Deviation 0.9997
Median 4
Minimum 4
Maximum 6

Sample

1st row Female
2nd row Female
3rd row Female
4th row Male
5th row Male

Letter

Count 14912
Lowercase Letter 11912
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Male, Female) take over 50.0%

Race

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 211836

Length

Mean 5.612
Standard Deviation 1.2091
Median 5
Minimum 5
Maximum 8

Sample

1st row Other
2nd row Asian
3rd row White
4th row White
5th row White

Letter

Count 16836
Lowercase Letter 13836
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0

ZipCode

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 210000

Length

Mean 5
Standard Deviation 0
Median 5
Minimum 5
Maximum 5

Sample

1st row 73301
2nd row 60601
3rd row 90210
4th row 10001
5th row 10001

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 15000
  • ZipCode has words of constant length

ChronicPainConditions

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 234429

Length

Mean 13.143
Standard Deviation 3.2147
Median 12
Minimum 9
Maximum 17

Sample

1st row Fibromyalgia
2nd row Cancer Pain
3rd row Fibromyalgia
4th row Fibromyalgia
5th row Post-Surgery Pain

Letter

Count 36486
Lowercase Letter 30543
Space Separator 2360
Uppercase Letter 5943
Dash Punctuation 583
Decimal Number 0
  • The largest value (pain) is over 2.82 times larger than the second largest value (fibromyalgia)

NumOpioidPrescriptions

numerical

Approximate Distinct Count 19
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 10.0257
Minimum 1
Maximum 19
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • NumOpioidPrescriptions is skewed right (γ1 = 0.0116)

Quantile Statistics

Minimum 1
5-th Percentile 2
Q1 5
Median 10
Q3 15
95-th Percentile 18
Maximum 19
Range 18
IQR 10

Descriptive Statistics

Mean 10.0257
Standard Deviation 5.3812
Variance 28.9573
Sum 30077
Skewness 0.01164
Kurtosis -1.1767
Coefficient of Variation 0.5367
  • NumOpioidPrescriptions is not normally distributed (p-value 1.2433396950261507e-23)

AverageDosage

numerical

Approximate Distinct Count 95
Approximate Unique (%) 3.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 52.089
Minimum 5
Maximum 99
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AverageDosage is skewed right (γ1 = 0.0344)

Quantile Statistics

Minimum 5
5-th Percentile 10
Q1 29
Median 52
Q3 76
95-th Percentile 95
Maximum 99
Range 94
IQR 47

Descriptive Statistics

Mean 52.089
Standard Deviation 27.3922
Variance 750.3319
Sum 156267
Skewness 0.03441
Kurtosis -1.2205
Coefficient of Variation 0.5259
  • AverageDosage is not normally distributed (p-value 0.0008336156827187905)

DurationOfPrescriptions

numerical

Approximate Distinct Count 29
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 14.974
Minimum 1
Maximum 29
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • DurationOfPrescriptions is skewed right (γ1 = 0.0095)

Quantile Statistics

Minimum 1
5-th Percentile 2
Q1 8
Median 15
Q3 22
95-th Percentile 28
Maximum 29
Range 28
IQR 14

Descriptive Statistics

Mean 14.974
Standard Deviation 8.3232
Variance 69.2757
Sum 44922
Skewness 0.00955
Kurtosis -1.1683
Coefficient of Variation 0.5558
  • DurationOfPrescriptions is not normally distributed (p-value 0.0)

NumHealthcareVisits

numerical

Approximate Distinct Count 20
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 9.5823
Minimum 0
Maximum 19
Zeros 164
Zeros (%) 5.5%
Negatives 0
Negatives (%) 0.0%
  • NumHealthcareVisits is skewed left (γ1 = -0.0298)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 4
Median 10
Q3 15
95-th Percentile 19
Maximum 19
Range 19
IQR 11

Descriptive Statistics

Mean 9.5823
Standard Deviation 5.769
Variance 33.2816
Sum 28747
Skewness -0.0298
Kurtosis -1.1987
Coefficient of Variation 0.602
  • NumHealthcareVisits is not normally distributed (p-value 4.2235679200666696e-70)

NumHospitalizations

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 198000

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 2
4th row 2
5th row 3

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3000
  • NumHospitalizations has words of constant length

PainManagementTreatment

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 201589
  • The largest value (No) is over 4.09 times larger than the second largest value (Yes)

Length

Mean 2.1963
Standard Deviation 0.3973
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row Yes
3rd row No
4th row No
5th row Yes

Letter

Count 6589
Lowercase Letter 3589
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 4.09 times larger than the second largest value (yes)

PrescriptionDate

categorical

Approximate Distinct Count 1355
Approximate Unique (%) 45.2%
Missing 0
Missing (%) 0.0%
Memory Size 225000

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2023-12-06
2nd row 2022-02-21
3rd row 2023-05-01
4th row 2020-03-04
5th row 2020-09-19

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 6000
Decimal Number 24000
  • PrescriptionDate has words of constant length

MedicationName

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 224070

Length

Mean 9.69
Standard Deviation 1.8467
Median 9
Minimum 7
Maximum 13

Sample

1st row Hydrocodone
2nd row Hydromorphone
3rd row Hydrocodone
4th row Oxymorphone
5th row Tramadol

Letter

Count 29070
Lowercase Letter 26070
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0

Dosage

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 216140
  • The largest value (10 mg) is over 1.84 times larger than the second largest value (80 mg)

Length

Mean 7.0467
Standard Deviation 3.1444
Median 5
Minimum 4
Maximum 13

Sample

1st row 10 mg
2nd row 100 mcg/hour
3rd row 20 mg
4th row 10 mg
5th row 80 mg

Letter

Count 10560
Lowercase Letter 10560
Space Separator 3000
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 6219
  • The largest value (mg) is over 2.29 times larger than the second largest value (mcghour)

Frequency

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 233916

Length

Mean 12.972
Standard Deviation 1.8704
Median 13
Minimum 10
Maximum 15

Sample

1st row every 8 hours
2nd row every 8 hours
3rd row every 4-6 hours
4th row every 12 hours
5th row every 4-6 hours

Letter

Count 29239
Lowercase Letter 29239
Space Separator 5239
Uppercase Letter 0
Dash Punctuation 723
Decimal Number 3715
  • The top 2 categories (every 8 hours, once daily) take over 50.0%

Duration

numerical

Approximate Distinct Count 29
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 15.041
Minimum 1
Maximum 29
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Duration is skewed left (γ1 = -0.0033)

Quantile Statistics

Minimum 1
5-th Percentile 2
Q1 8
Median 15
Q3 22
95-th Percentile 28
Maximum 29
Range 28
IQR 14

Descriptive Statistics

Mean 15.041
Standard Deviation 8.3515
Variance 69.7479
Sum 45123
Skewness -0.003272
Kurtosis -1.2053
Coefficient of Variation 0.5553
  • Duration is not normally distributed (p-value 0.0)

Refills

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 198000

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 2
4th row 0
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3000
  • Refills has words of constant length

MedicationClass

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 218115

Length

Mean 7.705
Standard Deviation 1.2379
Median 8
Minimum 6
Maximum 9

Sample

1st row Opioid
2nd row Narcotic
3rd row Analgesic
4th row Analgesic
5th row Analgesic

Letter

Count 23115
Lowercase Letter 20115
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Analgesic, Narcotic) take over 50.0%

Adherence

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 209972

Length

Mean 4.9907
Standard Deviation 2.1532
Median 4
Minimum 3
Maximum 8

Sample

1st row Moderate
2nd row Low
3rd row Moderate
4th row Low
5th row High

Letter

Count 14972
Lowercase Letter 11972
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (High, Low) take over 50.0%

ClinicalNotes

categorical

Approximate Distinct Count 11
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 350947

Length

Mean 51.9823
Standard Deviation 8.7039
Median 54
Minimum 39
Maximum 66

Sample

1st row Post-operative pai...
2nd row Prescribed Oxymorp...
3rd row Patient reports ef...
4th row Using Tramadol for...
5th row Patient reports ef...

Letter

Count 133900
Lowercase Letter 128164
Space Separator 16854
Uppercase Letter 5736
Dash Punctuation 561
Decimal Number 0
  • The largest value (pain) is over 1.79 times larger than the second largest value (prescribed)

Specialty

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 229593

Length

Mean 11.531
Standard Deviation 2.5034
Median 12
Minimum 8
Maximum 15

Sample

1st row Orthopedics
2nd row Pain Management
3rd row Oncology
4th row Orthopedics
5th row Oncology

Letter

Count 33078
Lowercase Letter 28563
Space Separator 1515
Uppercase Letter 4515
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Pain Management, Primary Care) take over 50.0%

AppointmentType

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 231707

Length

Mean 12.2357
Standard Deviation 2.8176
Median 12
Minimum 9
Maximum 16

Sample

1st row Routine Check-up
2nd row Consultation
3rd row Routine Check-up
4th row Consultation
5th row Follow-up

Letter

Count 33836
Lowercase Letter 29904
Space Separator 932
Uppercase Letter 3932
Dash Punctuation 1939
Decimal Number 0
  • The top 2 categories (Consultation, Follow-up) take over 50.0%

SubSpecialty

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 222128

Length

Mean 9.0427
Standard Deviation 1.9999
Median 11
Minimum 7
Maximum 11

Sample

1st row Specialized
2nd row General
3rd row Specialized
4th row Specialized
5th row General

Letter

Count 27128
Lowercase Letter 24128
Space Separator 0
Uppercase Letter 3000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Specialized, General) take over 50.0%

TimeofAppointment

categorical

Approximate Distinct Count 2945
Approximate Unique (%) 98.2%
Missing 0
Missing (%) 0.0%
Memory Size 219000

Length

Mean 8
Standard Deviation 0
Median 8
Minimum 8
Maximum 8

Sample

1st row 13:39:02
2nd row 19:44:46
3rd row 18:05:34
4th row 15:16:11
5th row 22:17:05

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 18000
  • TimeofAppointment has words of constant length

TimeSeenbyPhysician

categorical

Approximate Distinct Count 2935
Approximate Unique (%) 97.8%
Missing 0
Missing (%) 0.0%
Memory Size 219000

Length

Mean 8
Standard Deviation 0
Median 8
Minimum 8
Maximum 8

Sample

1st row 16:33:59
2nd row 17:33:37
3rd row 04:39:28
4th row 15:52:51
5th row 02:17:33

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 18000
  • TimeSeenbyPhysician has words of constant length

TotalTimeSpentwithPhysician

numerical

Approximate Distinct Count 50
Approximate Unique (%) 1.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48000
Mean 35.008
Minimum 10
Maximum 59
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • TotalTimeSpentwithPhysician is skewed left (γ1 = -0.0369)

Quantile Statistics

Minimum 10
5-th Percentile 12
Q1 23
Median 35
Q3 47
95-th Percentile 57
Maximum 59
Range 49
IQR 24

Descriptive Statistics

Mean 35.008
Standard Deviation 14.2716
Variance 203.6792
Sum 105024
Skewness -0.03688
Kurtosis -1.1781
Coefficient of Variation 0.4077

Interactions

Correlations

Missing Values